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PATENT APPLICATION 

SPHINGOSINE 1-PHOSPHATE RECEPTOR GENE. SPPR 
Inventors: Thomas P. Loughran, Jr. & Ravi Kothapalli 

FIELD OF THE INVENTION 

5 The present invention relates to the genetics of autoimmune diseases, including 

lymphoproliferative diseases, such as large granular lymphocyte leukemia (LGL), and 
rheumatoid arthritis (RA). Specifically, the invention relates to a novel sphingosine 1- 
phosphate receptor gene, herein termed sppr, and its splice variants. Sppr is up-regulated in 
LGL and is useful, for example, in the diagnosis and treatment of certain lymphoproliferative, 
10 neurodegenerative and autoiimnune diseases. 

BACKGROUND OF THE INVENTION 

Large granular Ijonphocjrte leukemia (LGL) is a rare form of lymphoproliferative 
disorder often associated with autoimmune disease (Loughran T. P., Clonal diseases of large 
granular lymphocytes. Blood 82, 1-14, 1993). 

15 The cause of LGL is still not fiilly understood. An increased count of large granular 

lymphocytes is characteristic of LGL leukemia. Patients with clonal CD3+LGL, as 
determined by cytogenetic or T-cell receptor (TCR) gene rearrangement studies, are 
classified as T-LGL. Some of these patients may resemble those with Felty's syndrome with 
clinical features of rheumatoid arthritis, neutropenia and splenomegaly (Ahem M. J., et al., P. 

20 Phenotypic and genotypic analysis of mononuclear cells from patients with Felty's syndrome. 
Ann.Rheum.49, 103-108, 1990.) Morbidity and mortality in patients with LGL leukemia 
typically results from infections acquired during severe neutropenia. 

The etiology of LGL leukemia is also not yet knoAvn. There is strong evidence that 
suggests that leukemic large granular lymphocytes are antigen activated cytotoxic T 
25 lymphocytes (CTL), but the nature of the antigen and of the initial stimulus leading to antigen 
driven expansion are not known. 



LGL leukemic cells express FAS and FAS ligand, but they are not actively 
undergoing apoptosis (Perzova, R and Loughran, T. P, Jr. Constitutive e3q)ression of Fas 



ligand in large granular lymphocyte leukemia. British Jnl. Haematology, 1997). How they 
acquire resistance to apoptosis is not known. 

Within the field of the diagnosis and treatment of LGL and other autoimmune 
diseases, there is a need for better tools for diagnosis and early detection of disease, specific 
5 therapeutic targets and treatments for the disease, and more specific reagents and tools with 
which to identify the pathogenic pathways of these diseases. The present invention provides a 
novel gene and splice variants that are linked to these diseases, and which address the 
aforementioned needs and more, as will become clear to one of skill in the art upon reading 
the following disclosure. 

10 SUMMARY OF THE INVENTION 

Large granular lymphocyte leukemia (LGL) is a lymphoproliferative disorder often 
associated with autoimmune disease. In order to identify differentially expressed genes in 
LGL leukemia, microarray analysis is performed from RNA isolated from PBMC of LGL 
leukemia patients and compared with noimal healthy individual(s). By screening a human 

1 5 LGL leukemia library the full-length sequence of a human gene that showed 85% identity 
with rat sphingosine 1-phosphate receptor is obtained. Two different isoforms are also 
identified by RT-PCR, designated sphingosine 1-phosphate receptor 1, also referred to as 
Slp5-a and sphingosine 1-phosphate receptor 2, also refered to as SlP5-p. Sphingosine 1- 
phosphate receptor (sppr ) is present in brain, spleen, PBMCs, liver and kidney. The present 

20 inventors found it is over-expressed in LGL leukemia patients when compare to normal 
individuals. 

In a first embodiment, the invention provides a gene comprising sppr or a splice S 
variant, or sppr protein or modified proteins or fi-agments thereof. 

In a further embodiment, the invention provides a nucleic acid capable of hybridizing 
25 to at least a portion of said sppr gene, including splice variants. 

In a further embodiment, the invention provides methods for screening for 
autoimmune diseases, including LGL or rheumatoid arthritis, based on overexpression of 
sppr. 
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In a further embodiment, the invention provides for monoclonal antibodies to sppr 
and their use in detection, diagnosis and treatment of disease states. 

In a further embodiment, the invention provides for screening of ligands, agonists, 
and antagonists of sppr. 

5 In a further embodiment, the invention provides for inhibition or treatment of 

neurodegenerative disease. 

In a preferred embodiment the present invention provides a sphingosine 1 -phosphate 
receptor gene. The use of said gene makes it possible to produce the sphingosine 1 -phosphate 
receptor protein with ease and in large quantities, and said protein, which has sphingosine 1- 
10 phosphate receptor activity, can be used in developing therapeutic agents for various diseases. 

Throughout this document the nomenclature sppr and S1P5 are used interchangeably. 
The receptor was initially termed sppr. However, to be consistent with a new nomenclature 
system this receptor was renamed S1P5. 

DESCRIPTION OF THE FIGURES 

15 FIGURE 1A.-B illustrates a microairay of the differential expression of the selected 

EST. (EST (GenBank ID 1868427) is obtained Incyte Genomics.) Figure lA-B shows a 
microarray hybridized with the fluorescent labeled probes generated using mRNA isolated 
fix)m PBMC of LGL leukemia patient and from mRNA isolated from normal healthy 
individual. Figure 1 A illustrates a microarray showing the expression of an LGL leukemia 

20 patient cDNAs. Figure IB illustrates a microarray showing the expression of a normal 

healthy individual. Arrows show the expression of EST in both patient and normal individual 
(GeneBank Id: N47089). Intensity bar shows the increased expression starting firom left to 
right. A balanced differential expression of 3.0 is determined for this EST. 

FIGURE 2 shows Northem blot analysis performed with 10 ug of total RNA isolated 
25 from PBMC of LGL leukemia patients and normal healthy mdividuals. These results 
demonstrate over-egression of EST in the PBMCs of LGL leukemia when compared to 
normal and normal activated PBMCs of healthy individuals. 
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FIGURE 3 shows the complete nucleotide sequence, SEQ ID NO: 4, of human 
sphingosine 1-Phosphate receptor (SPPR) cDNA and amino acid sequence (SEQ ED NO: 3) 
as predicted by the nucleic acid sequence. The ftilWength (2,2 kb) nucleotide sequence of 
sppr is compiled from sequences of clones isolated from an LGL leukemia library and RT- 
5 PGR products obtained by using the gene specific primers designed using the corresponding 
sequence from chromosome 19. 

FIGURE 4 shows the alignment of the amino acid sequence of SPPR with other 
members of the sphingosine 1-phosphate receptor family. The deduced amino acid sequence 
of sppr is compared with rat edg-1 and nrg-1. There is approximately 85% identity with these 
10 genes. 

FIGURE 5 shows the nucleotide sequence and deduced amino acid sequence of splice 
variant, sphingosine 1 -phosphate receptor 1. 1 .6 kb fragment is obtained by RT-PCR using 
total RNA isolated from PBMC of an LGL leukemia patient. The fragment is then cloned and 
sequenced. 

15 FIGURE 6 shows the nucleotide sequence and deduced amino acid sequence of splice 

variant, sphingosine 1 -phosphate receptor 2. The nucleotide sequence of an alternative splice 
variant of sppr and deduced sequence. 1 .2 kb fragment is obtained from RT-PCR using total 
RNA isolated from PBMC of LGL leukemia. The fragment is then cloned and sequenced. 

FIGURE 7 shows results of sppr Northern blot analysis with different tissues. 
20 Northern blot analysis is performed \ising a multiple tissue Northern blot (Clontech). 

Northern blots contain approximately 1 ug of poly A+ per lane from twelve different human 
tissues. A 1 .5 kb fragment containing the ftill-length open reading frame for sppr is used as a 
probe. Results show sppr is expressed in mainly bmin, spleen, and peripheral blood 
leukocytes. Small amounts of sppr are also expressed in lung, placenta, liver and kidney. 

25 FIGURE 8 shows nucleotide and deduced amino acid sequence of human SIP 5 

cDNA. FulHength (2.2 kb) nucleotide sequence of SIP sis compiled from the sequences of 
clones isolated from LGL leukemia library (clone 6) and RT-PCR products. GenBank 
Accession No. AF331840. The predicted amino acids of the coding region are shown 
underneath by a single letter abbreviation. The left side of the sequence shows nucleotide 



-4- 



numbers and the right side shows amino acid numbers. Possible seven transmembrane helices 
are underlined. The putative polyadenylation sites are in bold. 

FIGURE 9 shows Alignment of the deduced amino acid sequence of SiPj with other 
members: The deduced amino acid sequence of SIP sis compared with predicted amino acid 
5 sequences of rat edg-8 and nrg-1 . There is approximately 86% identity with these genes. * - 
single, fully conserved residue, : - conservation of strong groups, . - conservation of weak 
groups, - no consensus. 

FIGURE 10 shows activation of Erk2 by SIP in HEK293 cells transiently transfected 
with SIP5. HEK 293 cells ti^fected with the HA-ERK2 plasmid (0.2 jig ) and either 

10 pcDNA SlPs (0.5 |iig) or vector alone. Vector plasmid is added to each transfection reaction 
to equalize the amount of total DNA (2.1 jig ). After serum-starvation, the cells are treated 
with l|iM SIP or IpM LPA for 5 min (BSA was added to the controls). HA-ERK2 is 
immunoprecipitated from one half of each whole cell lysate and used for measuring the 
kinase activity utilizing MBP as substrate, while HA-ERK2 immunoprecipitated from the 

1 5 other half is used for determining the amount of ERK2 protein in the immune complex. 

Figure 10 illustrates a representative autoradiogram of ^^P incorporation into MBP catalyzed 
by HA-ERK 2 immunoprecipitated from transiently transfected cells treated as indicated. 
Figure 10 further illustrates the corresponding Westem blot demonstrating the amount of HA- 
ERK2 present in each of the immxme complexes. Figure 10 fiirther illustrates a plot of ERK 

20 2 activity (fold) normalized to the amoimt of ERK2 protein (means ± SD from three 
independent experiments). 

FIGURE 1 1 shows Northern blot analysis of SIP5 mRNf A expression in PBMC of 
LGL leiikemia patients and normal healthy individuals. Northern blot is performed with 10 
\ig of total RNA isolated from PBMC of LGL leukemia patients and normal healthy 
25 individuals. LGL = LGL leukemia patients, N= Normal healthy individual NA= Normal 
healthy individuals PBMCs activated by IL2 and PHA. These results demonstrate over- 
expression of SI Ps in the PBMC of LGL leukemia when compared to normal and normal 
activated PBMC of healthy individuals. 

FIGURE 12A-B shows comparison of the predicted amino acid sequences of SI P5, 
30 SIP5 -a and SlPs-^. The predicted amino acid sequences are aligned using CLUSTAL 
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program. Figure 12A illustrates the nucleotide sequence of an alternative splice variant of 
SlPs-a and deduced amino acid sequence. A 1,6 kb fragment is obtained from RT-PCR using 
total RNA isolated from PBMC of LGL leukemia patient. This fragment is cloned and 
sequenced. Figure 12B illustrates the nucleotide sequence of an alternative splice variant of 
5 SlPs'^ and deduced sequence. A 1 .2 kb fragment is obtained from RT-PCR using total RNA 
isolated from PBMC of LGL leukemia. This fragment is cloned and sequenced. 

FIGURE 13 shows tissue distribution of iS/Pj message. Northern blot analysis is 
performed using the multiple tissue blot obtained from Clontech. The Norfhem Blot contams 
approximately 2 ^g of poly A4* per lane from twelve different human tissues and a 1 .5 kb 
10 fragment containing the full-length open reading frame of SIP 5 is used as a probe. As shown 
above, SIP5 is expressed mainly in brain, spleen, and peripheral blood leukocytes. Trace 
amounts of SIP5 are also expressed in lung, placenta, liver and kidney. (Please note: Signals 
are significantly stronger in normal tissue on poly A + RNA Northem blot compared to total 
RNA Northem blot) 

15 DETAILED DESCRIPTION OF THE INVENTION 

The abbreviations for amino acids, peptides, base sequences, nucleic acids and so 
forth as used herein in the present specification are those recommended by the International 
Union of Pure and Applied Chemistry (lUPAC) and the International Union of Biochemistry 
(lUB) and in the "Guidelines for drafting patent specifications relative to base sequences 
20 and/or amino acid sequences" edited by the Japanese Patent Office or those commonly used 
in the relevant field of art. 

Although the genes of the present invention is represented by a single-stranded DNA 
sequence, as shown under, for example, SEQ ID N0:4, the present invention also includes 
the DNA sequence complementary to such a single-stranded DNA sequence as well as a 

25 component comprising both of these. The DNA sequence representing the gene of the present 
invention shown in the above-mentioned SEQ ID NO: 4 is an example of the codon 
combination coding for the respective amino acid residues according to the amino acid 
sequence shown in SEQ ID NO:7. The gene of the present invention is not limited to the 
above-mentioned one but may, of course, have any other DNA base sequence comprising a 

30 combination of codons arbitrarily selected for the respective amino acid residues without 
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altering the above-mentioned amino acid sequence. Selection of said codons can be carried 
out by the conventional method in which the codon usage or codon choice in the host to be 
used for gene recombination is taken into consideration [Nucl, Acids Res., 9,43-74(1981)], 
and these codons can be produced, for example by chemical synthesis, etc. 

5 The gene of the present invention further includes DNA sequences coding for those 

equivalents to the above-mentioned amino acid sequence that are derived from the latter by 
deletion, addition or like modification of one or more amino acid residues or part of the 
amino acid sequence and have similar sphingosine 1-phosphate receptor activity to that of the 
sphingosine 1-phosphate receptor protein. While production, alteration (mutation) or the like 

10 of these polypeptides may occur spontaneously, they can also be produced by 

posttranslational modification. Fxirthermore, any desired gene can be produced by gene 
engineering techniques such as the site-specific mutagenesis technique in which the natural 
gene (gene of the present invention) is altered, by a chemical synthesis technique such as the 
phosphite triester method in which mutant DNAs are synthesized or by combining both 

15 procedures. By utilizing the gene of the present invention, namely by incorporating the same 
into a vector for use with a microorganism, for instance, and cultivating the transformant 
microorganism, the sphingosine 1-phosphate receptor protein can be expressed readily and in 
large quantities, and said protein can be isolated and provided. Since said protein has 
sphingosine 1-phosphate receptor activity, it is effective for various pharmacological 

20 purposes, and it is also useful, among others, in elucidating the pathogenesis, the pathologies 
or the like of various diseases. More specifically, the recombinant sphingosine 1-phosphate 
receptor protein obtained by utUizing the gene of the present invention can effectively be 
used, for example, in elucidating the mechanism of immunosuppression in living bodies, 
developing or screening out therapeutic agents for autoimmune diseases (e.g. rheumatism, 

25 SLE (systemic lupus erythematodes), LGL, etc.), searching for endogenous ligands and 
substrates to the novel protem and developing therapeutic agents therefor. 

Similarly, the gene of the present invention can effectively be used, for example, ia 
elucidating the mechanism of neurodegeneration in living bodies, developing or screening out 
therapeutic agents for neurodegenerative diseases (e.g. alzheimers, parkinson's and the like), 
30 searching for endogenous ligands and substrates to the novel protein and developing 
therapeutic agents therefor. 
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In the follovdng, the gene of the present mvention will be described m more detail. 
The gene of the present invention can be isolated by general genetic engineering techniques, 
for example, by selecting an appropriate clone from among a human fetal brain cDNA library 
(cDNA synthesized in the conventional manner from mRNA isolated and purified from total 
5 RNA obtained in turn from appropriate origin cells containing a gene coding for the 
sphingosine 1 -phosphate receptor proteui) using appropriate probes, such as for example 
those of SEQ ID 1 and SEQ ID2, purifying said clone, and determining the base sequence 
thereof In the above procedure, the origin cells may be any animal cells or tissues where the 
occxirrence of sphingosine 1 -phosphate receptor protein is known (see for example, the 
10 experiment producing the results shown in FIG 6), or soluble fractions of cultured cells 
derived therefrom. This can be isolated and purified for the culture supematant by various 
chromatographic processes. 

In the practice of the present invention, it is also possible to use a part of the DNA 
fragment sequenced in the above manner as a probe, label this using a random prime DNA 
15 labeling kit (available from Takara Shu2o, Amersham, etc.) in accordance with the random 
prime DNA labeling method (Feinberg, A. P., et al., Anal. Biochem., 137, 266-267 (1984)), 
for instance, and use the thus-obtained labeled probe in screening out the desired sphingosine 
1-phosphate receptor protein gene. 

Using the above-mentioned labeled probe, for mstance, the desired DNA can be 
20 screened out by the plaque hybridization technique developed by Benton and Davis (Benton, 
W. and Davis, R., Science, 196, 383-394 (1977)). 

The gene of the present invention as obtained in the above manner can be cloned in 
various plasmids in the conventional manner. For instance, after cleavage with an appropriate 
restriction enzyme and purification, the gene of the present invention can be inserted into a 

25 cloning vector (e.g. plasmid) cleaved with the same restriction enzyme and purified, at the 
cleavage site thereof, whereby a recombinant plasmid can be obtained. By introducing said 
recombinant into an appropriate host (e.g. Escherichia coli) for transformation, a restriction 
enzyme map of the clone containing said gene can be drawn using the transfonnant by a 
conventional known method, for example the method as described in Sambrook, J. Fritsch, 

30 E.F., and Maniatis. Molecular cloning. A laboratory Manual 2nd edition. Cold Spring Harbor 
laboratory Press. Cold Spring Harbor, NY. After digestion of the above clone with an 
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appropriate restriction enzyme, the base sequence of said clone can be determined by the 
above-mentioned dideoxy method or the Maxam-Gilbert method, for instance. The base 
sequence determination mentioned above may also be readily performed using a 
commercially available kit or the like. 

5 The thus-determined DNA base sequence of the sphingosine 1-phosphate receptor 

protein gene of the present invention and the corresponding amino acid sequence encoded 
thereby are as shown in the sequence listmg under SEQ ID NO: 3 and SEQ ID N0:4. 

Using the above-mentioned gene (DNA) of the present mvention, the recombinant 
sphingosine 1-phosphate receptor protein can be obtained by various known gene 

10 recombination techniques [cf. for example Science, 224, 1431 (1984); Biochem. Biophys. 
Res. Comm., 130, 692 (1985); Proc. Natl. Acad. Sci. USA, 80, 5990 (1983)]. Said 
sphingosine 1-phosphate receptor protein is produced, in more detail, by constructing a 
recombinant DNA allowing expression of the gene of the present invention in host cells, 
introducing this into host cells for transformation thereof, and cultivating the transformant 

1 5 strain. The host cells may be either eukaiyotic or prokaryotic. As an expression vector for use 
with vertebrate cells, it is possible to use one containing a promoter generally located 
upstream of the gene to be expressed, an RNA splicing site, a polyadenylation site and a 
transcnption termination sequence and so on. This may further have a replication origin, as 
necessary. Yeasts are often and generally used as eukaryotic microorganisms and, among 

20 them, yeasts belonging to the genus Saccharomyces are advantageously used. Usable as 

expression vectors for use with said yeasts and other eukaryotic microorganisms are pAM82 
(A. Miyanohara et al., Proc, Natl. Acad. Sci. USA, 80, 1-5 (1983)) containing a promoter for 
the acid phosphatase gene, and like vectors. Escherichia coli and Bacilus subtilis are 
generally and very often used as prokaryotic host cells. When these are used as hosts in the 

25 practice of the present invention, an expression plasmid is preferably used which is derived, 
for instance, from a plasmid vector capable of replication in said host microorganisms and 
provided with a promoter, the SD (Shine and Dalgamo) base sequence and further an 
initiation codon (e.g. ATG) necessary for the initiation of protein synthesis, upstream from 
the gene of the present invention so that said gene can be expressed. As the host Escherichia 

30 coli mentioned above, the strain Escherichia coli K12 and the like are often used and, as the 
vector, pBR322 is generally and often used. However, the host and vector are not limited 
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thereto, but other various known microbial strains and vectors can also be used. As regards 
the promoter, the tryptophan (trp) promoter, 1 pp promoter, lac promoter and P.sub.L 
promoter, for instance, can be used. 

The thus-obtained desired recombinant DNA can be introduced into host cells for 
S transformation thereof by various conventional methods. The transformant obtained can be 
cultivated m tiie conventional manner, leading to production and accumulation of the desired 
sphingosine 1-phosphate receptor protein encoded by the gene of the present invention. The 
mediirai to be used in said cultivation can adequately be selected, according to the host cells 
employed, from among various media in common use. When Escherichia coli or like cells 

10 are used as host cells, for instance, transformant cultivation can be conducted xising LB 
medium, E medium, M9 medium, M63 medium or the like. To these media, there may be 
added, as necessary, generally known various carbon sources, nitrogen sources, inorganic 
salts, vitamins, nature-derived extracts, physiologically active substances, etc. The above- 
mentioned transformant cultivation can be carried out under conditions suited for the growth 

15 of the host cells. In the case of Escherichia coli, such conditions can be employed, for 

instance, as a pH of about 5 to 8, preferably 7 or thereabout, and a temperature of about 20 to 
43. degree. C, preferably 37.degree, C. or thereabout. In the above manner, the transformant 
cells produce and accumiilate intracellularly or secrete extracellularly the desired 
recombinant FK506 binding protein. 

20 Said desired protein can be isolated and purified by various separation techniques 

utilizing its physical, chemical and other properties [cf. for example "Seikagaku 
(Biochemistry) Data Book IF', pages 1175-1259, 1st edition, 1st printing, published Jun. 23, 
1980 by Kabushiki Kaisha Tokyo Kagaku Dojin; Biochemistry, vol. 25, No. 25, 8274-8277 
(1986); Eur. J. Biochem., 163, 313-321 (1987)]. As specific examples of said techniques, 

25 there may be mentioned conventional reconstitution treatment, treatment with a protein 

precipitating agent (salting out), centrifiigation, osmotic pressure shock treatment, ultrasonic 
disruption, ultrafiltration, various liquid chromatographic processes such as molecular sieve 
chronoiatography (gel filtration), adsorption chromatography, ion exchange chromatography, 
afiSnity chromatography and high performance liquid chromatography (HPLC), dialysis, and 

30 combinations of these. In the above maimer, the desired recombinant protein can be produced 
on an industrial scale with ease and with high efGciency. 
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In order to provide diagnostics for LGL leukemia, and provide therapeutic targets for 
drugs directed to mitigate the pathogenesis of LGL leukemia, microarray analysis is 
perforaied to identify dijfferentially expressed genes. A large number of genes are identified 
that are differentially expressed in LGL leukemia compared to normal controls. One of the 
5 ESTs of approximately 300 base pairs is fiilly characterized herein. Initial Blast analysis 
shows 100 % homology with Homo-sapiens full-length uxsert cDNA clone YY 85D04 
(gb/AF 088014). No open reading frame within the full-length insert cDNA. Therefore, in 
order get the complete sequence of the gene, the LGL leukemia library is screened and also 
RT-PCR is performed using the total RNA isolated from different LGL leukemia patients. 15 

10 positive clones are selected from library screening. All of them give partial sequences with 
the longest one being approxunately 340 base pairs shorter (clone 6). BLAST search with 
htgs, shows that clone 6 shows 100% homology with genomic sequence present m the 
chromosome 19. Primers are designed based on the genomic sequence information to obtain 
full-length sequence of the gene. By using these primers in the PGR with genomic DNA and 

15 RT-PCR with total RNA, the full-length gene, SEQ ID:4 is obtained. This gene belongs to 
the G-protein-coupled receptor super-femily of integral membrane proteins. BLAST analysis 
of the complete gene reveals 85% homology with rat sphingosine 1 -phosphate receptor edg-8 
andnrg-l(Im, D., et al.. Characterization of a Novel sphingosine 1-Phosphate receptor, Edg- 
8. J.Biol.Chem.275. 1428 1-14286 (2000); Glickman, M., et al., Molecular cloning, tissue- 

20 specific expression and chromosomal localization of a novel nerve growth factor regulated G- 
protem-coupled receptor, nrg-l . Mol. Cell. Neurosci. 14, 141-152 (1999)), shown in FIG.4. 
It is interesting to note that this gene is present mainly in brain, spleen and PBMCs (FIG.7), 
and it is over expressed in PBMC of LGL leukemia patients and is be involved in LGL 
leukemia cell survival or proliferation. 

25 Material and Methods: 

Isolation of Peripheral blood mononuclear cells (PBMC and RNA). PBMC are 
isolated from normal healthy individuals and from LGL leukemia patients. Trizole is 
obtained from GTBCO-BRL. EST (GenBank ID 1868427) is obtained Incyte Genomics. 
Oligotex mRNA mini-kit, plasmid isolation kits, gel extraction kits, and PCR reagents are 
30 purchased from Qiagen; RNA loading dye is from Sigma Chemical Co. The Prime-a-Gene 
labeling kit is from Promega Corp. (Madison, WI). Deoxycytidme 5'triphosphate dCTP a- 
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32P (3,000 Ci/ mmol) is ifrom Dupont NEN (Boston, MA). Nytran membrane is obtained 
from Schleicher& Schuell, Inc., 10 optical Avenue, Keene, N.H. Nick translation columns are 
obtained from Pharmacia Chemical Co. The Topo-TA cloning kit is from Invitrigen. 

PBMC are isolated from whole blood using FicoU-Hypaque density gradient 
5 centrifugation. The PBMC cells are suspended in Trizole reagent (GIBCO-BRL, Rockville, 
MD) and total RNA is immediately isolated according to the Oligotex mRNA mini-kit 
manufacturer's instructions and stored at — ^70 ^C. Poly A+ RNA is isolated from total RNA 
by using Oliogo-Tex mini mRNA kit according to the manufacturer's recommendations. 
PBMCs are cultured in vitro and activated by Interleukin 2 and phytohemagglutinin (PHA) 
10 for 2 to 3 days. In a preferred embodiment, PBMC is cultured in vitro and activated by PHA, 
(Sigma Chemical Co. St. Louis, MO) (l^g/ml, 2 days) and Interleukin-2 (IL-2) (100 U/ml, 10 
days), Next, total RNA is isolated as described above. 

Microarray probing and analysis is done by Incyte Genomics, (St. Louis, Missouri). 
Approximately 1 ug of Poly (A)* RNA isolated from PBMCs of LGL leukemia and healthy 

15 individual is reverse transcribed to generate Cys3 and Cys 5 fluorescently labeled cDNA 
' probes. In a preferred embodiment, more than 90% of PBMC from the LGL leukemia patient 
are leukemic LGL as indicated by CD 8"^. staining. cDNA probes are competitively hybridized 
to a human UniGEM V cDNA microarray containing approximately 7075 immobilized 
cDNA fragments (4107 known genes and 2968 ESTs). Scanning and quantitation is 

20 performed by Incyte Genomics and balanced differential differentiation is given for all the 
genes. The balanced differential expression is calculated using the ratio between the PI signal 
(intensity reading for probe 1) and the balanced P2 signal (intensity reading for probe 2 
adjusted using the balanced coefficient). A balanced differential expression of 2.0 is 
considered indicative of up-regulation of a given gene. 

25 Verification of clones: GEM cDNA clones are purchased from Incyte Genomics as 

individual bacterial stabs and streaked on LB /agar plates containing appropriate antibiotic(s). 
Individual colonies are picked and grown in LB medium. Plasmid DNA is isolated and 
sequenced in order to verify the correct identity of each clone. 

Northern Blot analysis: Northem Blotting is done as described previously (Sambrook 
30 et al, 1998). Essentially, 10 ug of total RNA from each sample is denatured at 65 °C in a 
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RNA loading buffer, electrophoresed in 1% agarose containing 2.2 M fonnaldehyde gel, and 
blotted onto a Nytran membrane. (Nytran membrane obtained &om Schleicher & Schuell, 
Inc, Keene,N.H). The RNA is fixed to the membrane by UV cross-linking. cDNA is labeled 
with pP] (Prime-a-Gene labeling kit firom Promega Corp. Madison, WI, deoxyc)rtidine 
5'triphosphate ( dCTP a-^^P, 3,000 Ci/ mmol, Dupont NEN, Boston, MA) and purified by 
Nick columns (Amersham Pharmacia Biotech AB, Piscataway, NJ). Hybridization and 
washings of the blots are performed as described by Engler-Blum, G., Meier, M ., Frank, J., 
and Muller, GA. Reduction of background in problems in non-radioactive Northern blot 
analysis enables higher sensitivity than 32P-based hybridizations. Anal. Biochem. 210, 235- 
244 (1993). 

Library construction and screening. cDNA is synthesized firom poly(A)"*" RNA 
isolated firom pooled PBMCs of multiple LGL leukemia patients using oligo dT primer. The 
cDNA is unidirectionally inserted the EcoRI / Xhol sites of Lambda ZAPII 
(Stratagene).cDNA library is screened using EST according to standard protocol (Sambrook 
et aL, 1 989). In a preferred embodiment, DNA libraries are plated at a density of 50,000 
plaque-forming imits per 150mm plate. Following incubation for 8 h at 37 ^C, the plated 
phage are overlaid with nitrocellulose filters. After 1 min the filters are removed and the 
membranes are crossed linked by Autocross linker. A pP] labeled cDNA firagment derived 
firom an EST ( GenBank accession No. N 47089) of interest is used to probe the filters. 
Hybridizations, washings, exposure of the membranes to films and then picking up the 
colony of interest are performed as outlined in the standard methodology (Sambrook et aL, 
1989). Secondary and tertiary screenings were also performed as outlined in standard 
methodology (Sambrook et aL, 1989). After isolation of pure phage containing the gene of 
interest, mini- preparations or macro-preparation are made to isolate plasmid cDNA 
containing the gene of interest. 

RT-PCR: To obtain the fiill-length sequence, 5' and 3' primers are designed based on 
the sequence information available in GenBank: 

5' GCGCGGCCCAT GGAGTC 3' (SEQ.ID# 1) 

is used as forward primer and 

y CTTTTCTGTGTTCCCAAGC AGAAC GTCAAT 3' (SEQ.ID#2) 
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is used as reverse primer. Total RNA from PBMC isolated from LGL leukemia patients and 
normal healthy individuals is used as a template for reverse transcriptase for making cDNA 
using either oligo(dT) primer or random hexamer primers. The PGR reaction mixture is 
heated to 95 for 2 min and then cycled 40 times at 95 for 30 sec, 60^C for 45 sec, and 
72 for 1 .5 min. Finally, the reaction mixture is heated at 72 °C for 7 min and stored at 4 
^C. The reaction product is electrophoresed in 1% agarose gels. For direct PGR, all the 
conditions are the same as above except that genomic DNA, isolated from PBMC, is used as 
a DNA template. PGR products are analyzed in 1 % agarose gel and the bands are excised and 
cloned into a TOPO-TA cloning vector (Invitrogen) and sequenced. The insert is subcloned 
into EcoRl sites of mammalian expression vector pcDNA3. 1 to produce pcDNASSlPs, 

Cell culture and transfection HEK293 cells are grown in Dulbecco's modified 
eagle's medium supplemented with 10% fetal bovme serum. The cells are transiently 
transfected with a plasmid encoding HA-tagged Erk2 (HA-Erk2) and either pcDNA 3 SIP5 or 
pcDNA 3.1. Transfection is achieved by incubating the cells in 60 mm plates with 
plasmid/Lipofectamine complexes (2.1 |ig total DNA/12 \xl Lipofectamine) in serum-free 
medium for 5 hours. The DNA complexes are removed from the medium and the cells are 
starved overnight in serum-free medium and then used for experimentation. 

Erk2 Kinase Assay. The serum-starved transiently transfected HEK293 cells are 
treated for 5 min preferably with either l|iM sphingosine-1 -phosphate (SIP) or with l)xM 
lysophosphatidic acid (LPA). The cells are lysed in buffer containing 50 mM Tris-HGl pH 
7.5, 150 mM NaGl, 1 mM EDTA, 1 mM EGTA, 1 mM DTT, 1% Triton X-100, 25 mM NaF, 
5 mM sodium pyrophosphate, 20 mM p-nitrophenyl phosphate, 2 \ig/mL leupeptin, and 100 
|ig/mL phenyhnethylsulfonyl fluoride. HA-Erk2 is unmunoprecipitated with the monoclonal 
antibody HA.11 (Convance, Richmond, CA). Half of the immunoprecipitate is used to 
determine Erk 2 activity and the other half is used for measuring Erk2 protein e3q)ression. For 
the kinase assay, immune complexes are incubated for 10 min at 30 in 40 jil of buffer 
containing 20 mM Hepes, pH 7.5, 10 mM MgCb, 1 mM dithiothreitol, 10 mM /?-nitrophenyl 
phosphate, 40 ^M ATP and 0.375 mg / mL myelin basic protein and 10 |LiCi of [y-^^P] ATP 
(3000 Ci / mmol). The reaction is terminated with SDS-containing gel-loading buffer and the 
reaction mixtures are analyzed on 1 1% SDS-polyacrylamide gels. The gels are processed by 
autoradiography. The bands on the gels are quantitated with a Phosphorlmager. Erk2 protein 
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in the immunoprecipitate is determined by immunoblotting with a polyclonal antibody to 
Erk2. 

Examples 

Referring now to FIG. 1, approximately 50 genes are up-regulated in LGL leukemia, 
with balanced differential expression of between about 7.8 and about 2.0. In addition, one 
EST is particularly noteworthy that is up-regulated m LGL leukemia with balanced 
differential expression of 3.0 (GenBank Accession number N47089). A clone containing this 
EST is sequenced. The total length of the EST is approximately 300 base pairs. A search 
using Blast shows 100% homology with another EST (GenBank Accession No. AF088014) 
named as homo sapiens full length insert cDNA clone YY85D04. No other infonnation 
regarding this EST is found in the literature. No open reading frame is found within this 
sequence. Northem blot analysis confirms that a gene related to EST (GenBank ID No. 
N47098) is upregulated in majority of LGL leukemia patients. 

Using the microarray screening method, one LGL leukemia patient is compared with 
one normal healthy individual. To show the same pattem in a larger sample of patents, 
Northem blot analysis is performed. Total RNAs, isolated fiom the PBMC of normal healthy 
individuals and LGL leukemia patients, are used in Northem blots. Initially, a 300 base pair 
cDNA fragment is used as a probe in initial experiments. Up-regulation of EST is observed in 
all the LGL leukemia patients when compared to the normal healthy individuals. This 
confirms the microarray results regarding EST expression. The probe hybridizes to a 2.2 kb 
transcript in the Northern Blots. (Fig-2). 

An LGL leukemia library is constructed from the noiRNA isolated from the pooled 
PBMCs of the seven LGL leukemia patients. This library is screened to obtain full-length 
sequence of the gene. Approximately 15 positive clones are selected and the larger clones are 
sequenced. The largest clone is 1500 bp in length. Analysis using Blast indicates that this 
gene has 85% homology with Rat edgS (Im et al, 2000). All of the clones are missing 5'end 
of the gene. Blast search with htgs show 99% homology with the sequence present in 
chromosome 19. Based on the sequence information, primers are designed fit)m the 5' end 
and from 3'endofthe open readmg frame of the gene. Three different products (1.5, 1.6, and 
1 .2 bp m length) are obtained using RT-PCR. These products are subjected to gel 
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electrophoresis and bands are excised, cloned into TOPO-TA cloning vectors and sequenced. 
■ The largest PGR product contains the entire open reading fratae (FIG.3). The deduced amino 
acid sequence shows 85% homology with complete sequence of rat sphingosine 1-phosphate 
receptor edg-8 and nrg-1. (FIG. 4). Shorter bands are also identified. The shorter bands are 
5 excised, cloned, and sequenced. These clones are splice variants of sphingosine 1 - phosphate 
receptor with deletions. They are herein termed "sphingosine 1-phosphate receptor -1" and 
"sphingosine 1-phosphate receptor-2" (FIG.5 & 6). 

Expression of sphmgosme 1-phosphate receptor is examined in different normal 
tissues by Northern blot analysis. It is found that sppr is expressed in several tissues such as 
1 0 brain, spleen and PBMCs. (FIG.7). Only trace amounts are detected in Jukat and CEM cell 
lines (data not shown). 

To obtain a full-length sequence of the gene, an LGL leukemic cDNA library is 
constructed and screened usmg the EST probe. Approximately 15 positive clones are selected 
and larger clones are sequenced. The BLAST search of the largest clone (1500 bp) mdicates 
15 that this gene has strong homology with Rat edg-8/Nrg-l. However, all of the clones are 
missing the 5'end of the gene when compared to the rat gene. A BLAST search with the 
human genome shows 99% homology with a sequence present on chromosomel9. Based on 
this sequence mfonnation, primers are designed fiom the 5' and 3'ends of the open reading 
fiame of the gene. 

Three different RT-PCR products (1.5, 1.6, and 1.2 bp) are obtained. These products 
are subjected to gel electrophoresis. The resulting bands are excised and cloned into TOPO- 
TA cloning vectors and then sequenced. The largest PGR product contams a complete open 
reading fiame. The nucleotide sequence and the deduced amino acids are shown in Figure 8. 
The gene is designated as SIP5 (see below). The nucleotide sequence shows very strong 
homology witii G-protein coupled receptors, eq)ecially witii the endothelial differentiation 
genes (EDGs). When the deduced amino acid sequence of the fiiU-length sequence is aligned 
with other members of the femily using the GLUSTALW (multi sequence alignment) 
program, it is approximately 26 to 44% identical and 58 to 72% similar with EDGs at amino 
acid level (Tablel). In addition, it shows 86% identity and 96% sunilarity with rat edg-8 or 
rat nrg-1 at amino acid level. (Figure 9, Table I), Transient transfection of HEK293 cells with 
this gene results m activation of Erk2 activity in response to sphingosme- 1-phosphate but not 
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LPA, confirming that it is a sphingosine-1 -phosphate receptor (Figure 10). Therefore, this 
gene is named SIP5. 



Samples firom 30 LGL leukemia patients are tested for the presence of SlPs transcript 
by Northern blot analysis using full-length gene as a probe. Constitutive expression of SlPs 
transcripts is found in 24 samples (Fig. 1 1). In comparison iS'iP5 transcripts are expressed at 
only trace levels in normal PBMC (N=12). After activation of normal PBMC the expression 
of SI Ps is reduced to undetectable levels (Fig. 12). Additionally, expression of two smaller 
bands is detected in samples firom leukemic LGL by RT-PCR. Human SIP5 transcripts are 
expressed mainly in normal brain, spleen, and PBMC and in trace amounts m lung, kidney 
and liver (Fig. 13). Whereas expression of Edg-8 is observed only in brain and spleen of rat 
when Northern Blots are probed. Several cell lines are examined for the presence of S IP 5 
transcript Trace amounts of SIP5 transcripts are identified in CEM and Jurkat cells (data not 
shown). All other cell lines tested are negative for 57^5 transcript including MT2 (HTLV-I 
infected cell line) and MO-T (HTLV-II infected cell line), Moe7 (megakaryoblastic leukemic 
cell line) and U293 (human embryonic kidney cells). 

Table 1. Identity and similarity between SlPs and other members of the Edgs. 

The deduced amino acid sequence of SlPs is aligned with the amino acid sequences 
of various members of Edgs. using the CLUSTALW program. Except for Edg 8 and nrg-1, 
all other sequences are &om human. All the sequence information is obtained fi"om GenBank. 



Name of the gene 


% Identity 


% Similari^ 


hSiP5 


100 


100 




87 


96 


rNrg-1 


86 


98 


hEdg-r 


44 


72 


hEdg-5**- 


41 


66 


hEdg.3" 


40 


70 


hEdg-6* 


39 


67 


h Edg-2'^ 


35 


67 


hEdg-4'^ 


30 


60 


hEdg-T' 


26 


58 



^ Sphingosine I- phosphate leoqiton 
it »Lysq)bospIiad(KcBcidrecq)ton 
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Leukemic LGL are resistant to Fas-induced apoptosis, in spite of over-expression of 
Fas and Fas-ligand (FasL) implying that the accumulation of circulating LGL can be due to 
dysregulation of apoptosis. The accumulation of circulating LGL in leukemic patients can 
also be due to clonal proliferation of LGL. In order to understand the molecular mechanisms 
involved in pathogenesis of LGL leukemia, microarray techniques are used to identify 
differentially expressed genes. Approximately 50 genes are identified that are up-regulated 
and 10 genes that are down regulated. Several ESTs are also identified which show 
differential expression. As a systematic study, one of the ESTs that is up-regulated in LGL 
Leukemia is characterized. The full-length gene is obtained by screening the LGL leukemia 
library and performing RT-PCR, which is 85% identical to the rat Sphingosine — 1 Phosphate 
receptor. This gene belongs to G-protein coupled receptor super family and can act as a 
sphingosine-l-phosphate receptor. Several splice variants in LGL leukemia patients are also 
identified, and are named Sphingosine 1 -phosphate receptor 1 and Sphingosine 1 -Phosphate 
receptor 2. The deduced andno acid sequence of Sphingosine 1-Phosphate receptor with rat 
edgS ornrg shows 85% homology. It has seven transmembrane domains, which is a 
characteristic of GTP-coupled receptors. Thus, the Sphingosine-1 Phosphate is involved in the 
signal transduction &om the sphingosine 1 -Phosphate in human. 

Although the gene has lot of homology with other members of edg family, it is 
preferably named sphingosine-1 -phosphate receptor {SIP 5) because it is mainly present in 
20 lymphocytes, brain and spleen, but not in endothelial cells. 

Lysophosphatidic acid (LP A) and sphingosine 1 -phosphate (SIP) mediate T cell 
function. Both LP A and SIP signaling pathways are implicated in cell proliferation, 
suppression of apoptosis, enhancement of cellular survival and T-lymphoma cell invasion. 
Although it has been suggested that SIP can act as an intracellular mediator, it has also been 

25 suggested that SIP acts as an extracellular ligand for cell surface receptors. Indeed several 
such receptors have been identified in a wide variety of tissues. For example, receptors Edg- 
1, -3, -5, -6 and -8, are specific for SIP, whereas Edg-2, -4, and -7 are LPA specific. In 
normal lymphocytes, there is differential constitutive expression of receptors for LPA and 
SIP. CD4^ cells express predominantly Edg-4, while CD8^ cells appeared to lack receptors 

30 for LPA and SIP as only traces of Edg-2 and Edg-5 are detected. Human T cell tumors 
express many Edgs for both LPA and SIP. 
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Rat edg-8 / nrg-l is shown to be a sphingosine-l-phosphate receptor based on specific 
binding of radio-labeled SIP to cell membranes, inhibition of forskolin-mduced cAMP 
accumulation, increased OTP binding ability and calcium mobilization studies. Even though 
these properties are adequate to classify edg-S/nrg-l as a sphingosine-l-phosphate receptor, 

5 it seems surprising that this gene is different from other members of the human sphingosine- 
1- phosphate receptor family. For example, activation of EDG-1, -3, - 5 and --6 by SIP leads 
to activation of Erkl / 2 and induction of cell proliferation. In contrast SIP inhibited serum- 
induces activation of Erkl / 2 and also mhibits the cell proliferation in CHO cells expressing 
EDG-8. The reasons for these differences are not known and might be due to species 

10 variation. As shown herein, SIP activates Erk2 in transiently tranfected HEK293 cells while 
lysophosphatidic acid does not, suggesting that SI Ps is a sphingosine-l-phosphate receptor 
and participates in sphingosine 1 -phosphate mediated signal transduction. A computational 
model of the Edg-1 receptor predicts that Glu is essential for interaction with SIP [21]. 
The SIP receptors Edg-1, -3, -5 and -8 as well as SlPs share such an anionic residue. 

15 Leukemic LGL are antigen driven CTL that survive in vivo, at least in part, because of 

defective apoptosis. For example, leukemic LGL express both Fas and Fas-ligand, but are 
resistant to Fas mediated death. It is noteworthy that SIP5 gene transcripts are down 
regulated after activation of normal T cells. Leukemic cells are activated T cells. Based upon 
the results disclosed herein, constitutive expression of SiPj transcripts represents 

20 dysregulated expression. This dysregulated expression of SlPs may participate in protection 
of leukemic LGL from apoptosis. 

Note: The ftill-length sequence was deposited in GenBank (Accession No.AF33 1 840) on 
December 22, 2000. 

Throughout this application, various publications, including United States patents, 
25 have been referred to. The disclosures of these publications and patents in their Mitireties are 
hereby incorporated by reference into this application to more fully describe the state of the 
art to which this invention pertains. 

While the invention has been described in terms of various preferred embodiments, 
those skilled in the art will recognize that various modifications, substitutions, omissions, and 
30 changes may be made without departing from the spirit of the present invention. Accordingly, 
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it is intended that the scope of the present invention be limited solely by the scope of the 
following claims. 
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What is claimed is: 



1 . An isolated purified sphingosine 1 -phosphate receptor protein gene having fhe 
sequence of SEQ ID N0:4, SEQ ID N0:9, or SEQ ID NO: 13. 

2. A purified nucleic acid comprising a sequence of more that 100 base pairs that is, or is 
5 complementary to, at least 100 bases of SEQ ID N0:4, SEQ ID N0:9, or SEQ IID N0:13. 

3. A genetic construct comprising a sequence of more that 100 base pairs that is, or is 
complementary to, at least 100 bases of SEQ ID N0:4, SEQ ID N0:9, or SEQ ID N0:13. 

4. A purified nucleic acid that hybridizes to a portion of the nucleic acid of SEQ IID 
N0:4, SEQ ID N0:9, or SEQ ID NO: 13. 

10 5. A purified protein having greater that 8S% homology to the amino acid sequence of 
SEQ.ID N0:3, SEQ.ID N0:7, SEQ.ID N0:8, SEQ.ID NO:10, SEQ.ID N0:1 1, SEQ.ID 
N0:12, or SEQ .ID N0:14, or a fragment or an analog thereof. 

6. A method for screening for an autoimmune disease, comprising: providing a sample 
from a patient suspected of having an autoimmune disease; and screening said sample for 

1 5 over-expression of sppr. 

7. The method of claim 6, in which said sample comprises RNA and said screening 
comprises measuring sppr mRNA. 

8. The method of claim 6 in which said sample comprises protein and said screening 
comprises measuring sppr protein. 

20 9. The method of claim 6, in which said disease is LGL or rheumatoid arthritis. 

10. A purified protein that is sppr or a fi:agment or derivative thereof. 

11. A method of producing a recombinant sppr protein which comprises introducing a 
base sequence containing the sppr protein gene of claim 1 into a host to thereby transform said 
host, cultivating the thus-obtained transformant, and recovering the recombinant sppr protein 

25 thus produced. 

12. An expression vector which contains a gene as claimed in claun 1 . 
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13. A host cell which is transfonned with a vector as clauned in claim 12. 



14. The purified nucleic acid of claim 2 wherein said nucleic acid is ribonucleic, acid. 

1 5. The purified nucleic acid of claim 14 wherein said ribonucleic acid is mRNA. 

16. An antisense nucleic acid molecule complementary to the mRNA of claim 15, or a 
firagment thereof 

17. An expression vector comprising the nucleic acid molecule of claim 2. 

1 8. A method for screening for a neurodegenerative disease, comprising: providing a 
sample fi-om a patient suspected of having a neurodegenerative disease; and screening said 
sample for over-expression of sppr. 

19. The method of claim 18, in which said sample comprises RNA and said screening 
comprises measuring sppr mKNf A. 

20. The method of claim 19 in which said sample comprises protein and said screening 
comprises measuring sppr protein. 
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FIG. 3 

Human sphingosine 1-Phosphate receptor 

LOCUS tmpseq_l 2336 bp 4-DEC-2000 

SOURCE PBMCs (LGL) 

ORGANISM Human 

Unclassified. 
FEATURES Location/Qualifiers 
source 1..2336 
CDS 10.-1206,10. .1206 

/note="predicted coding region" 
/translation-" 

MESGLLRPAPVSEVIVLHYNYTGBCLRGARYQPGAGLRADAWCLAVCAFIVLENLAVLLV SEQ. ID #3 

LGRHPRFHAPMFLLLGSLTLSDLLAGAAYAANILLSGPLTLKLSPALWFAREGGVFVALT 

ASVLSLLAIALERSLTMAFIRGPAPVSSRGRTLAMAAAAWGVSLLLGLLPALGWNCLGRLD 

ACSTVLPLYAKAYVLFCVLAFVGILAAICALYARIYCQVRANARRLPARPGTAGTTSTRA 

RRKPRSLALLRTLSWLLAFVACWGPLFLLLLLDVACPARTCPVLLQADPFLGLAMANSL 

LNPI I YTLTNRDLRHALLRLVCCGRHSCGRDPSGSQQSASAAEASGGLRRCLPPGLDGS F 

SGSERSSPQRDGLDTSGSTGSPGAPTAARTLVSEPAAD" 

BASE COUNT 461 a 679 c 701 g 495 t 

ORIGIN 

1 gcgcggccca tggagtcggg gctgctgcgg ccggcgccgg tgagcgaggt catcgtcctg 
61 cattacaact acaccggcaa gctccgcggt gcgcgctacc agccgggtgc cggcctgcgc 
121 gccgacgccg tggtgtgcct ggcggtgtgc gccttcatcg tgctagagaa tctagccgtg 
181 ttgttggtgc tcggacgcca cccgcgcttc cacgctccca tgttcctgct cctgggcagc 
241 ctcacgttgt cggatctgct ggcaggcgcc gcctacgccg ccaacatcct actgtcgggg 
301 ccgctcacgc tgaaactgtc ccccgcgctc tggttcgcac gggagggagg cgtcttcgtg 
361 gcactcactg cgtccgtgct gagcctcctg gccatcgcgc tggagcgcag cctcaccatg 
421 gcgcgcaggg ggcccgcgcc cgtctccagt cgggggcgca cgctggcgat ggcagccgcg 
481 gcctggggcg tgtcgctgct cctcgggctc ctgccagcgc tgggctggaa ttgcctgggt 
541 cgcctggacg cttgctccac tgtcttgccg ctctacgcca aggcctacgt gctcttctgc 
601 gtgctcgcct tcgtgggcat cctggccgcg atctgtgcac tctacgcgcg catctactgc 
661 caggtacgcg ccaacgcgcg gcgcctgccg gcacggcccg ggactgcggg gaccacctcg 
721 acccgggcgc gtcgcaagcc gcgctcgctg gccttgctgc gcacgctcag cgtggtgctc 
781 ctggcctttg tggcatgttg gggccccctc ttcctgctgc tgttgctcga cgtggcgtgc 
841 ccggcgcgca cctgtcctgt actcctgcag gccgatccct tcctgggact ggccatggcc 
901 aactcacttc tgaaccccat catctacacg ctcaccaacc gcgacctgcg ccacgcgctc 
961 ctgcgcctgg tctgctgcgg acgccactcc tgcggcagag acccgagtgg ctcccagcag 
1021 tcggcgagcg cggctgaggc ttccgggggc ctgcgccgct gcctgccccc gggccttgat 
1081 gggagcttca gcggctcgga gcgctcatcg ccccagcgcg acgggctgga caccagcggc 
1141 tccacaggca gccccggtgc acccacagcc gcccggactc tggtatcaga accggctgca 
1201 gactgacacc ctcggcccac gactgtcttc ccaagtttta cagacttgtt ctttttacat 
1261 aaaggaattt gtaggaaatg cagccaaagg tgcagtcgga aaagatgcag gggaaatgta 
1321 tttatgcagc gacaccccac aatgtgaaca aacagacaaa aaatctgtgc cctcgtggaa 
1381 ttgacgttct gcttgggaac acagaaaaga actcggtgat gaaataatgg agatgattcc 
1441 agtgacaaac gacagagatg gtgatggtgg tcagggaaga cctctctgca gaggtagtga 
1501 cttgtgatgt gagctgagac ctctgtcctg ggaagaccaa aagaaaagca tttcaggatg 
1561 agggaatggc atgcgcaaag gccctgaggc tgaaatgtgc ccatgtgttc taagaaatgc 
1621 agcgatgctg gtgtgcctgg agcagggacg gagggggaga atgggaggag acaaggagct 
1681 gaaggagtag ttcccgaagg accttgtggg tgatatagag gacttcgctt ttgctctgag 
1741 tgaggtggga gccatagaag cttctaagca gaagagggac ttgccctaat tcaggtgatc 
1801 acaggtgtct tgtggcctcc atgggaggtt gaaaaccaca gaaggtgaag gggggctgca 
1861 ctgagccaca ggaacaatga tggagattcc agctaagccc agaccccgtg gattctagat 
1921 agattttaga ggcagcagac agaattactg aggaattgag tgtaagagtg gaataaagtt 
1981 atcaaggaca atgccaaggg tggggcaccc ccaaatttga ctttgggaga ctcagccaaa 
2041 tcctatctgg taataaaatt tcttttttat ttttcttttc tttctttctt tctttctttc 
2101 tttttttttt tttgagttgg gatcttgtgc tctgtcaccc aggctggagt gcaatgggca 
2161 caattatagc tcactgcagc ctggaactcc tgggatcaag cctggagttc ctgcttcagc 
2221 ctccctagta gctgggacta caggcatgca ccaccatgcc cagttaataa aatttcttca 
2281 aatgcaaaaa aaaaaaaaaa aaaaaactcg agggggggcc cggtacccaa ttcgcc SEQ. ID #4 
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FIG. 4 

Alignment of deduced Amino acid sequence with Nrg-1 and Edg-8 (rat 
genes) 

Nrg-1 MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAAVCLAVCAFIVLBNLAVLLV 

EDG-8 MBSGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAAVCLAVCAFIVLENLAVLLV 
SPPR MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAWCLAVCAFIVLENLAVLLV 

******************* 

Nrg-1 LGRHPRFHAPMFLLLGSLTLSDLLAGAAYATNILLSGPLTLRLSPALWFAREGGVFVALA 
EDG-8 LGRHPRFHAPMFLLLGSLTLSDLLAGAAYATNILLSGPLTLRLSPALWFAREGGVFVALA 
SPPR LGRHPRFHAPMFLLLGSLTLSDLLAGAAYAJUSIILLSGPLTLKLSPALWFAREGGVFVALT 
******************************-**********.*****************. 

Nrg-1 ASVLSLLAIAIERHLTMARRGPAPAASRARTLAMAVAAWGLLLTLGLLPALGWNCLGRLE 

EDG-8 asvlsllaialerhltmarrgpapaasrartlamavaawglslllgllpalgwnclgrle: 
SPPR asvlsllaii^ersltmarrgpapvssrgrtlamaaaawgvslllgllpalgwnclgrld 

**********.** **********.************. * **************** 

Nrg-1 ACSTVLPVYAKAYVLFCVLAFLGILAAICMjYARIYCQVRANARRLRAGPGSRRATSSSR 
EDG-8 ACSTVLPLYAKAYVLFCVLAFLGILAAICALYARIYCQVRANARRLRAGPGSRRATSSSR 
SPPR ACSTVLPLYAKAYVLFCVLAFVGILAAICALYARIYCQVRANARRLPARPGT-AGTTSTR 

*******•*************.************************ * **; ^*.*.* 

Nrg-1 SRHTPRSLALLRTLSWLLAFVACWGPLFLLLLLDVACPARACPVLLQADPFLGLAMANS 
EDG-8 SRHTPRSLALLRTLSWLLAFVACWGPLFLLLLLDVACPARACPVLLQADPFLGLAMANS 
SPPR ARRKPRSLALLRTLSWLLAFVACWGPLFLLLLLDVACPARTCPVLLQADPFLGLAMANS 
J*.^*************************************.****************** 

Nrg-1 LLNPIIYTFTNRDLRHALLRLLCCGRGPCNQDSSNSLQRSPSAVGPSGGGLRRCLPPTLD 
EDG-8 LLNPIIYTFTNRDLRHALLRLLCCGRGPCNQDSSNSLQRSPSAVGPSGGGLRRCLPPTLD 
SPPR LLNPIIYTLTNRDLRHALLRLVCCGRHSCOIDPSGSQQ-SASAAEIASGG-LRRCLPPGLD 
********-************.**** . * .******* *** ******* ** 

Nrg-1 RSSSPSEHSCPQRDGMDTSCSTGSPGAATANRTLVPDATD- SEQ. ID #5 

EDG-8 RSSSPSEHSCPQRDGMDTSCSTGSPGAATANRTLVPDATD- SEQ. id #6 

SPPR GSPSGSERSSPQRDGLDTSGSTGSPGAPTAARTLVSEPAAD SEQ. ID #7 
* * *. * ********* ******* ** **** • . 

SPPR: Nrg -85% 
SPPR: EDG -86% 

* - single, fully conserved residue 
: - conservation of strong groups 
. - conservation of weak groups 
- no consensus 
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FIG. 5 

Sphinges ine-l- phosphate receptor. 1 

LOCUS tmpseq 1 1698 bp 30-OCT-2000 

DEFINITION No definition line found. 

ACCESSION tmpseq_.l 

VERSION 

KEYWORDS 

SOURCE Unknown. 
ORGANISM Unknown. 

Unclassified. 
FEATURES Location/Qualifiers 
source 1 . .1698 
CDS 11. .775,11. .775 

/note= "predicted coding region" 

/translation="MESGLLRPAPVSEVIVLHYNYTGKLRGARyQPGAGLRADAWCL 
AVCAFIVLENLAVLLVLGRHPRFHAPMFLLLGSLTLSVPARPGTAGTTSTRARRKPRS 
LALLRTLSVVLLAFVACWGPLFLLLLLDVACPARTCPVLLQADPFLGLAMRNSLLNPI 
lYTLTNRDLRHALLRLVCCGRHSCGRDPSGSQQSASAAEASGGLRRCLPPGLDGSFSG 

SERSSPQRDGLDTSGSTGSPGAPTAARTLVSEPAAD" SEQ. ID #8 

BASE COUNT 352 a 462 c 516 g 368 t 

ORIGIN 

1 cgcgcggccc atggagtcgg ggctgctgcg gccggcgccg gtgagcgagg tcatcgtcct 

61 gcattacaac tacaccggca agctccgcgg tgcgcgctac cagccgggtg ccggcctgcg 

121 cgccgacgcc gtggtgtgcc tggcggtgtg cgccttcatc gtgctagaga atctagccgt 

181 gttgttggtg ctcggacgcc acccgcgctt ccacgctccc atgttcctgc tcctgggcag 

241 cctcacgttg tcggtgccgg cacggcccgg gactgcgggg accacctcga cccgggcgcg 

301 tcgcaagccg cgctcgctgg ccttgctgcg cacgctcagc gtggtgctcc tggcctttgt 

361 ggcatgttgg ggccccctct tcctgctgct gttgctcgac gtggcgtgcc cggcgcgcac 

421 ctgtcctgta ctcctgcagg ccgatccctt cctgggactg gccatggcca actcacttct 

481 gaaccccatc atctacacgc tcaccaaccg cgacctgcgc cacgcgctcc tgcgcctggt 

541 ctgctgcgga cgccactcct gcggcagaga cccgagtggc tcccagcagt cggcgagcgc 

601 ggctgaggct tccgggggcc tgcgccgctg cctgcccccg ggccttgatg ggagcttcag 

661 cggctcggag cgctcatcgc cccagcgcga cgggctggac accagcggct ccacaggcag 

721 ccccggtgca cccacagccg cccggactct ggtatcagaa ccggctgcag actgacaccc 

781 tcggcccacg actgtcttcc caagttttac agacttgttc tttttacata aaggaatttg 

841 taggaaatgc agccaaaggt gcagtcggaa aagatgcagg ggaaatgtat ttatgcagcg 

901 acaccccaca atgtgaacaa acagacaaaa aatctgtgcc ctcgtggaat tgacgttctg 

961 cttgggaaca cagaaaagaa ctcggtgatg aaataatgga gatgattcca gtgacaaacg 

1021 acagagatgg tgatggtggt cagggaagac ctctctgcag aggtagtgac ttgtgatgtg 

1081 agctgagacc tctgtcctgg gaagaccaaa agaaaagcat ttcaggatga gggaatggca 

1141 tgcgcaaagg ccctgaggct gaaatgtgcc catgtgttct aagaaatgca gcgatgctgg 

1201 tgtgcctgga gcagggacgg agggggagaa tgggaggaga caaggagctg aaggagtagt 

1261 tcccgaagga ccttgtgggt gatatagagg acttcgcttt tgctctgagt gaggtgggag 

1321 ccatagaagc ttctaagcag aagagggact tgccctaatt caggtgatca caggtgtctt 

1381 gtggcctcca tgggaggttg aaaaccagag aaggtgaagg ggggctgcac tgagccacag 

1441 gaacaatgat ggagattcca gctaagccca gaccccgtgg attctagata gattttagag 

1501 gcagcagaca gaattactga ggaattgagt gtaagagtgg aataaagtta tcaaggacaa 

1561 tgccaagggt ggggcacccc caaatttgac tctgggagac tcagccaaat cctatctggt 

1621 aataaaattt cttttttatt tttcttttct ttctttcttt cttttttttt tttttgagtt 

1681 gggatcttgt gctctgtc SEQ. ID #9 

SIP KES6IJiRPAFVSEVr\riiR7NTTGKLRGAK7QFGAGLRADAVV^^ 

S 1 PI MBSGIJiRPAPVSEVIVLHTNYTGKLRGARTQPGAGLRADAWCIAVCAPI^^ 

SIP L6RHPRFHAH&dFIiLLGSLTLSDLLAGAAYAANILLSGPLTLKLSPALWFAREGGVFVALT 

SlPl LGRHPRFHAPtdFLLLGSLTLS 

********************* 
SIP ASVLSLLAIALERSLTMARRGPAPVSSRGRTLAMAAAAWGVSLLLGLLPALGWNCLGRLD 
SlPl 

SIP ACSTVLPLYAKAYVLPCVIiAFVGIIAAICALYARIYCQVRANARRLPARPGTAGTTSTRA 

SlPl VPARPGTA6TTSTRA 

: ************** 

SIP RBKFRSZAIJiRTLSVVLIJ^FVACMGFLFLX:iIJiIJ>VAC^ 
SlPl BraCPRSZAIJiRTLSVVZJAFVACRGPLFLIJXZJ^ 

************************************************************ 
SIP LNFZIYTLTNBDIiRKAZiIiRLVCCeRH 

SlPl IiNFIZ7TLTNia)Ii{UU^ia«VCCGRHSC6RDFSGSQQ3ASAA£ASG6IJtKCLPFGIiDGSF 

******************************************************* 4r**** 

SIP SGSBRSSPQRDGIiDTSGSTGSPGAPTAARTLVSEPAAD SEQ. ID #10 

SlPl SGSERSSPQBDGIiDTSGSTGSPGAFTAARTLVSEPAAD SEQ. ID #11 

************************************** 



SIP« Sphingosine-1- Phosphate receptor versus 
SIPlaSphingosine-l-Phosphate 1 receptor 
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FIG. 6 



Sphingosine -1-Phosphate receptor 2 



LOGOS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

FEATURES 
source 
CDS 



tinpseq_^l 1245 bp 

No definition line found. 

tinpseq_l 



30-OCT-2000 



Unknown . 
Unknown . 
Unclassified- 
Location/Qualifiers 

I. .1245 

II. .322,11. .322 

/note="predicted coding region" 

/translation="MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAWCL 
AVCAFIVLENLAVLLVLGRHPRFHAPMFLLLGSLTLSDLLAGAAYAAAARTLVSEPAA- 



D" 

BASE COUNT 298 c 
ORIGIN 

1 cgcgcggccc 
gcattacaac 
cgccgacgcc 
gttgttggtg 
cctcacgttg 
atcagaaccg 



284 c 



372 g 



291 t 



SEQ. ID #12 



61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 



atggagtcgg ggctgctgcg 
tacaccggca agctccgcgg 
gtggtgtgcc tggcggtgtg 
ctcggacgcc acccgcgctt 
tcggatctgc tggcaggcgc 



gccggcgccg gtgagcgagg tcatcgtcct 
tgcgcgctac cagccgggtg ccggcctgcg 
cgccttcatc gtgctagaga atctagccgt 
ccacgctccc atgttcctgc tcctgggcag 
cgcctacgcc gccgccgccc ggactctggt 



gctgcagact gacaccctcg gcccacgact gtcttcccaa gttttacaga 



cttgttcttt ttacataaag gaatttgtag gaaatgcagc 
atgcagggga aatgtattta tgcagcgaca ccccacaatg 
gtggaattga cgttctgctt 



ctgtgccctc 
taatggagat 
tctgcagagg 
aaagcatttc 
gtgttctaag 
gaggagacaa 
tcgcttttgc 
cctaattcag 
gtgaaggggg 
cccgtggatt 
agagtggaat 
gggagactca 
tttctttctt 



gattccagtg 
tagtgacttg 
aggatgaggg 
aaatgcagcg 
ggagctgaag 
tctgagtgag 
gtgatcacag 
gctgcactga 
ctagatagat 
aaagttatca 
gccaaatcct 
tttttttttt 



acaaacgaca 
tgatgtgagc 
aatggcatgc 
atgctggtgt 
gagtagttcc 
gtgggagcca 
gtgtcttgtg 
gccacaggaa 



gggaacacag 
gagatggtga 
tgagacctct 
gcaaaggccc 
gcctggagca 
cgaaggacct 
tagaagcttc 
gcctccatgg 
caatgatgga 



tttagaggca gcagacagaa 
aggacaatgc caagggtggg 
atctggtaat aaaatttctt 
ttgagttggg atcttgtgct 



caaaggtgca 
tgaacaaaca 
aaaagaactc 
tggtggtcag 
gtcctgggaa 
tgaggctgaa 
gggacggagg 
tgtgggtgat 
taagcagaag 
gaggttgaaa 
gattccagct 



gtcggaaaag 
gacaaaaaat 
ggtgatgaaa 
ggaagacctc 
gaccaaaaga 
atgtgcccat 
gggagaatgg 
atagaggact 
agggacttgc 
accagagaag 
aagcccagac 



ttactgagga attgagtgta 
gcacccccaa atttgactct 
ttttattttt cttttctttc 
ctgtc SEQ. ID #13 



SIP MESGLLRPAPVSEVIVI.HYNYTGKIiRGARyQPGAGLRADAWCLAVCAFIVIiENIJl^ 
S I P2 MESGUiRPAPVSEVIVLHYNYTGKWGARYQPGAGIJ^AWCLAVCi^IV^ 



SIP LGRHPRFHAPMFLLLGSLTLSDXJJlGJkAYAANILLSGPLTLKLSPALWFAREGGVFVALT 
SIP2 LGRHPRPHAEb4PIJ.I.GSI.TLSDlilAGAAYAA 

it-k-kit-k-k-k'kitirit'kic'k'kiiit'k-k-k'k'klr'k^e'kitic'kieit 
SIP AS VLSLLAI ALERSLTMARRGPAPVSSRGRTLAMAAAAWGVSLLLGLLPALGWNCLGRLD 
SIP2 



SIP ACSTVLPLYABCAYVLFCVLAFVGILAAICALYARI YCQVRANARRLPARPGTAGTTSTRA 
SIP2 

SIP RRKPRSLALLRTLSVVLLAFVACWGPLFLLLLLDVACPARTCPVLLQADPFLGLAMANSL 

SIP2 

SIP LNPIIYTLTNRDLRHALLRLVCCGRHSCGRDPSGSQQSASAAEASGGLRRCLPPGLDGSF 

SIP2 

SIP SGSERSSPQRDGLDTSGSTGSPGAPTAARTLVSEPAAD 

SIP2 AARTLVSEPAM SEQ. ID #14 
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FIG. 8 

1 gcgcggcccatggagtcggggctgctgcggccggcgccggtgagcgaggtcatcgtcctg 

MESGLLRPAPVSEVIVL 17 
61 cattacaactacaccggcaagctccgcggtgcgcgctaccagccgggtgccggcctgcgc 

HYNYTGKLRGARYQPGAGLR 37 
121 gccgacgccgtggtgtgcctggcggtgtgcgccttcatcgtgctagagaatctagccgtg 

A D AVVCLAVCAFIVLENLAV 57 
181 ttgttggtgctcggacgccacccgcgcttccacgctcccatgttcctgctcctgggcagc 

L L V LGRHPRFHAPMF L L L G S 77 
241 ctcacgttgtcggatctgctggcaggcgccgcctacgccgccaacatcctactgtcgggg 

LTLSDLLAGAAYAANILLSG 97 
301 ccgctcacgctgaaactgtcccccgcgctctggttcgcacgggagggaggcgtcttcgtg 

PLTLKLSPALWFAREGGVFV 117 
361 gcactcactgcgtccgtgctgagcctcctggccatcgcgctggagcgcagcctcaccatg 

ALTASVLSLLAIAL E R S L T M 137 
421 gcgcgcagggggcccgcgcccgtctccagtcgggggcgcacgctggcgatggcagccgcg 

ARRGPAPVSSRGRT L A M A A A 157 
481 gcctggggcgtgtcgctgctcctcgggctcctgccagcgctgggctggaattgcctgggt 

AWGV5LLLGLLPAL G W N C L G 177 
541 cgcctggacgcttgctccactgtcttgccgctctacgccaaggcctacgtgctcttctgc 

RLDACS TVLPLYAKAY V L F C 197 
601 gtgctcgccttcgtgggcatcctggccgcgatctgtgcactctacgcgcgcatctactgc 

VLAFVGILAAICALYA R I Y C 217 
661 caggtacgcgccaacgcgcggcgcctgccggcacggcccgggactgcggggaccacctcg 

QVRANARRLPARPGTAGT TS 237 
721 acccgggcgcgtcgcaagccgcgctcgctggccttgctgcgcacgctcagcgtggtgctc 

TRARRKPRSLALLRTLSVVL 257 
781 ctggcctttgtggcatgttggggccccctcttcctgctgctgttgctcgacgtggcgtgc 

LAFVACWGPLFLLLLL D V A C 277 
841 ccggcgcgcacctgtcctgtactcctgcaggccgatcccttcctgggactggccatggcc 

P A RTC PVLLQADPFLGLAMA 297 
901 aactcacttctgaaccccatcatctacacgctcaccaaccgcgacctgcgccacgcgctc 

NSLIiNPII YTLTNRDLRHAL 317 
961 ctgcgcctggtctgctgcggacgccactcctgcggcagagacccgagtggctcccagcag 

LRLVCCGRHSCGRDPSGSQQ 337 
1021 tcggcgagcgcggctgaggcttccgggggcctgcgccgctgcctgcccccgggccttgat 

SASAAEASGGLRRCLPPGLD 357 
1081 gggagcttcagcggctcggagcgctcatcgccccagcgcgacgggctggacaccagcggc 

GSFSGSERSSPQRDGLDTSG 377 
1141 tccacaggcagccccggtgcacccacagccgcccggactctggtatcagaaccggctgca 

STGSPGAPTAARTLVSEPAA 397 
1201 gactgacaccctcggcccacgactgtcttcccaagttttacagacttgttctttttacat 

D * .398 
1261 aaaggaatttgtaggaaatgcagccaaaggtgcagtcggaaaagatgcaggggaaatgta 
1321 tttatgcagcgacaccccacaatgtgaacaaacagacaaaaaatctgtgccctcgtggaa 
1381 ttgacgttctgcttgggaacacagaaaagaactcggtgatgaaataatggagatgattcc 
1441 agt gacaaacgacagaga tggt gatggt ggt cagggaagacct ct ctgcagaggtagt ga 
1501 cttgtgatgtgagctgagacctctgtcctgggaagaccaaaagaaaagcatttcaggatg 
1561 agggaatggcatgcgcaaaggccctgaggctgaaatgtgcccatgtgttctaagaaatgc 
1621 agcgatgctggtgtgcctggagcagggacggagggggagaatgggaggagacaaggagct 
1681 gaaggagtagttcccgaaggaccttgtgggt gat at agaggactt cgctt ttgctctgag 
1741 tgaggtgggagccatagaagcttctaagcagaagagggacttgccctaattcaggtgatc 
1801 acaggtgtcttgtggcctccatgggaggttgaaaaccacagaaggtgaaggggggctgca 
18 61 ctgagccacaggaacaatgatggagattccagctaagcccagaccccgtggattctagat 
1921 agattttagaggcagcagacagaattactgaggaattgagtgtaagagtggaataaagtt 
1981 atcaaggacaatgccaagggtggggcacccccaaatttgactttgggagactcagccaaa 
2041 tcctatctggtaataaaatttcttttttatttttcttttctttctttctttctttctttc 
2101 tttttttttttttgagttgggatcttgtgctctgtcacccaggctggagtgcaatgggca 
2161 caattatagctcactgcagcctggaactcctgggatcaagcctggagttcctgcttcagc 
2221 ctccctagtagctgggactacaggcatgcaccaccatgcccagttaataaaatttcttca 
2281 aatgcaaaaaaaaaaaaaaaaaaaaa 
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FIG. 9 

Nrg-l MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAAVCLAVCAFIVLENLAVLLV 
EDG-8 MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAAVCLAVCAFIVLENIiAVLLV 
S1P5 MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAWCLAVCAFIVLENLAVLLV 
************* ******************* 

Nrg-l LGRHPRFHAPMFLLLGSLTLSDLLAGAAYATNILLSGPLTLRLSPALWFAREGGVFVALA 
EDG-8 LGRHPRFHAPMFLLLGSLTLSDLLAGAAYATNILLSGPLTLRLSPALWFAREGGVFVALA 
SIP5 LGRHPRFHAPMFLLLGSLTLSDIiLAGAAYAANILLSGPLTLKLSPALWFAREGGVFVALT 

Nr g- 1 AS VLSLLAI AI ERHLTMARRGPAPAASRARTLAMAVAAWGLLLTLGLLPALGWNCLGRLE 

EDG-8 ASVLSLLAIALERHLTMARRGPAPAASRARTLAMAVAAWGLSLLLGLLPALGWNCLGRLE 
Slpg ASVLSLLAIALERSLTMARRGPAPVSSRGRTLAMAAAAWGVSLLLGLLPALGWNCLGRLD 

4e*it'k'kicififk'k , icit irieir-k -k-k-k *-k-k ^ > * -k ^ -k -k •kieif -k ^ it-kif . ^ iek-k-k-kk-k-k-k-k-kk-k-k -k * 

Nrg-l ACSTVIiPVYAKAYVLFCVLAFLGILAAICALYARIYCQVRANARRLRAGPGSRRATSSSR 
EDG-8 ACSTVLPLYAKAYVLFCVLAFLGILAAICALYARIYCQVRANARRLRAGPGSRRATSSSR 
SIP5 ACSTVLPLYAKAYVLFCVLAFVGILAAICALYARIYCQVRANARRLPARPGT-AGTTSTR 

•k'k-kkk'k-k»irk'k-kit'k'k'k-k-kitit'kiir'k'kk-k'kirk'k-k**'k*it*'k'k'k*'k'kir'k * k-k » 

Nr g- 1 SRHTPRSLALLRTLSVVLLAFVACWGPLFLLLLLDVACPARACPVLLQADPFLGLAN^ 

EDG-8 SRHTPRSLALLRTLSWLLAFVACWGPLFLLLLLDVACPARACPVLLQADPFLGLAMANS 
S 1 P5 ARRKPRSLALLRTLSWXiLAFVACWGPLFLLLLLDVACPARTCPVLLQADPFLGLAMANS 

Nrg-l LLNPIIYTFTNRDLRHALLRLLCCGRGPCNQDSSNSLQRSPSAVGPSGGGLRRCLPPTLD 

EDG-8 LLNPIIYTFTNRDLRHALLRLLCCGRGPCNQDSSNSLQRSPSAVGPSGGGIiRRCLPPTLD 

SIP5 LLNPIIYTLTNRDLRHALLRLVCCGRHSCGRDPSGSQQ-SASAAEASGG-LRRCLPPGLD 

* -k^kk^ ^-k-k-k ******* ** 

Nrg-l RSSSPSEHSCPQRDGMDTSCSTGSPGAATANRTLVPDATD- 
EDG-8 RSSSPSEHSCPQRDGMDTSCSTGSPGAATANRTLVPDATD- 

S 1 P5 GS FSGSERS S PQRDGLDTSGS TGS PGAPTAARTLVSEP7\AD 
* * *. *^*****>*** *******^** ****^.^. 

SIP5: Nrg -85% 

S1P5: EDG -8 6% 

* - single, fully conserved residue 
: - conservation of strong groups 
. - conservation of weak groups 
- no consensus 
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FIG. 12A 



SIP5 MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAVyCLAVCAFIVLENLAVLLV 

SIP5- a MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAVVCLAVCAFIVLENLAVLLV 
************************************************************ 

SIP5 LGRHPRFHAPMFLLLGSLTLSDLLAGAAYAANILLSGPLTLKLSPALWFAREGGVFVALT 

SlP5-a LGRHPRFHAPMFLLLGSLTLS 

********************* 

SIP 

'''^''5 ASVLSLLAIALERSLTMARRGPAPVSSRGRTLAMAAAAWGVSLLLGLLPALGWNCLGRLD 

slPg-a 

SIP5 ACSTVLPLYAKAYVLFCVLAFVGILAAICALYARIYCQVRANARRLPARPGTAGTTSTRA 

SlP^-a VPARPGTAGTTSTRA 

.************** 

^ ^5 RRKPRSLALLRTLSVVLIAFVACWGPLFLLLLLDVACPARTCPVLLQADPFLGIAMANSL 

SIP5- a RRKPRSLALLRTLSWLLAFVACWGPLFLLLLLDVACPARTCPVLLQADPFLGLAMANSL 
***************************** *****************************jt* 

^■^^5 LNPIIYTLTNRDLRHALLRLVCCGRHSCGRDPSGSQQSASAAEASGGLRRCLPPGLDGSF 

SlPg-a LNPIIYTLTNRDLRHALLRLVCCGRHSCGRDPSGSQQSASAAEASGGLRRCLPPGLDGSF 
************************************************************ 

^^^5 SGSERSSPQRDGLDTSGSTGSPGAPTAARTLVSEPAAD 

SlP^-a SGSERSSPQRDGLDTSGSTGSPGAPTAARTLVSEPAAD 
************************************** 



FIG. 12B 

SIP5 MESGLLRPAPVSEVIVLHYNYTGBCLRGARYQPGAGLRADAWCLAVCAFIVLENLAVLLV 

SIP5- P MESGLLRPAPVSEVIVLHYNYTGKLRGARYQPGAGLRADAWCLAVCAFIVLENLAVLLV 
*******************************************************^**** 

SIP5 LGRHPRFHAPMFLLLGSLTLSDLLAGAAYAANILLSGPLTLKLSPALWFAREGGVFVALT 

SIPR-P LGRHPRFHAPMFLLLGSLTLSDLLAGAAYAA 

******************************* 

SIP5 ASVLSLLAIALERSLTMARRGPAPVSSRGRTLAMAAAAWGVSLLLGLLPALGWNCLGRLD 
SIP3-P 

SIP3 ACSTVLPLYAKAYVLFCVLAFVGILAAICALYARIYCQVRANTUIRLPARPGTAGTTSTRA 
SIP3-P 

SIP5 RRKPRSLALLRTLSWLLAFVACWGPLFLLLLLDVACPARTCPVLLQADPFLGLAMANSL 

SIP5-P 

SIP5 LNPIIYTLTNRDLRHALLRLVCCGRHSCGRDPSGSQQSASAAEASGGLRRCLPPGLDGSF 



SIP5 SGSERSSPQRDGLDTSGSTGSPGAPTAARTLVSEPAAD 
AARTLVSEPAAD 
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